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Granular computing is a computational paradigm that mimics human cognition in terms of grouping 
similar information together. Compatibility operators such as cardinality, orientation, density, and mul- 
tidimensional length act on both in raw data and information granules which are formed from raw data 
providing a framework for human-like information processing where information granulation is intrin- 
sic. Granular computing, as a computational concept, is not new, however it is only relatively recent when 
this concept has been formalised computationally via the use of Computational Intelligence methods such 
as Fuzzy Logic and Rough Sets. Neutrosophy is a unifying field in logics that extents the concept of fuzzy 
sets into a three-valued logic that uses an indeterminacy value, and it is the basis of neutrosophic logic, 
neutrosophic probability, neutrosophic statistics and interval valued neutrosophic theory. In this paper 
we present a new framework for creating Granular Computing Neural-Fuzzy modelling structures via the 
use of Neutrosophic Logic to address the issue of uncertainty during the data granulation process. The 
theoretical and computational aspects of the approach are presented and discussed in this paper, as well 
as a case study using real industrial data. The case study under investigation is the predictive modelling 
of the Charpy Toughness of heat-treated steel; a process that exhibits very high uncertainty in the mea- 
surements due to the thermomechanical complexity of the Charpy test itself. The results show that the 
proposed approach leads to more meaningful and simpler granular models, with a better generalisation 
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performance as compared to other recent modelling attempts on the same data set. 
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1. Introduction 


Extracting information and converting it to ‘easy to interpret’ 
knowledge is a very important but not a trivial task in Systems Engi- 
neering, in particular in the case of very complex and non-linear 
processes [1]. Within this context, Soft Computing techniques can 
be utilised to offer their transparency and interpretability potential. 
Transparency plays a significant role as a measure of interpretabil- 
ity and distinguishability, i.e. the more interpretable information of 
a system under study, the better its understanding. Unlike popular 
clustering approaches such as Fuzzy C-Means, Granular Comput- 
ing (GrC) [2,3] groups data not only based on similar mathematical 
properties such as proximity but also it considers the raw data as 
conceptual entities that are captured in a compact and transpar- 
ent manner [4]. Therefore, such individual entities are merged into 
dense information granules whose similarity [3] can be evaluated 
in a variety of ways depending on the application at hand. In GrC all 
operators act on the information granules and raw data, which can 
embed useful (for the data mining process) granular knowledge 
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such as the proximity to other information granules, cardinality, 
density, function similarity, orientation, overlap, etc. 

In literature a number of granular frameworks appeared [5-8] 
via the use of Rough Sets, Fuzzy Logic and Neural Networks. 

Even though granulation process [8] groups similar entities, 
there is not any measure which leads how much a granule must 
grow. This phenomenon produces a grade of inclusion uncertainty 
among the new granules as a consequence of a ravenous behaviour. 
Usually, in fuzzy systems a parsimony model is related to its inter- 
pretability as a consequence of a good distinguishability. However, 
as it is mentioned in [9] when adaptive learning algorithm is 
introduced into a fuzzy inference system, there might be a loss in 
the interpretability. We believe a distinguishable and initial gran- 
ular framework can aid in the estimation of the fuzzy inference 
parameters. To exemplify a case study model used in this paper 
is based on a real industry dataset related to the measurement 
of Charpy Toughness in heat-treatment steel and it suffers from 
sparsely data set [10,11]. The use of Neutrosophic domain pro- 
vides an extra dimension which measures the entropy produced by 
the creation of a new granule. And it persuades the compatibility 
search in eliminating potential granules to be merged. In this paper 
we present a systematic approach for the construction of granular 
objects by means of a Neural-Fuzzy modelling structure based on 
Radial Basis Functions (RBF) and Neutrosophy. A Neural Fuzzy sys- 
tem combines the learning capabilities of Neural-Networks (NN) 
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and the advantages of system transparency as offered by Fuzzy- 
Logic (FL) systems. Furthermore, a large number of NF approaches 
have been proposed in the past, just to name a few [12-18]. 

In spite of the functional equivalence between RBF neural 
networks and fuzzy systems (FS) [12,19], fuzzy capabilities such as 
transparency and interpretability loss their real power as a conse- 
quence of black-box properties in RBF neural structures. Therefore, 
one of the major objectives in Soft Computing (SC) systems and 
modelling, the development of transparent knowledge and rule- 
bases is crucial for the interpretation of real world systems. The 
motivation behind the research work presented in this paper is 
to propose a systematic methodology by using granulation that is 
capable of identifying the data uncertainty produced by merging 
two different information granules in a data space. In particular, 
those granules which are quite similar, and how their orientation 
can be influenced in order to produce better distinguishability in 
the creation of fuzzy partitions. 

In this paper, the hesitation produced during information gran- 
ulation particularly due to the lack of distinguishability in the initial 
fuzzy partition is measured by the use of a neutrosophic index [20], 
which is a generalization of the Intuitionistic Fuzzy Sets [21]. In 
intuititionistic theory the uncertainty of an element A is produced 
by t7a=Matva <1, and a: X> [0,1] and vg: X= [0,1]. In neutro- 
sophic sets the tuple <t, i, f> represents the truth, falsehood, and 
indeterminacy, with the latter often used in literature as uncer- 
tainty, hesitation, ignorance, etc. From a mathematical point of 
view, neutrosophic sets and the set-theoretic operators should be 
defined according to the problem at hand. In this paper we define 
the set-theoretic operators based on the entropy (uncertainty) pro- 
duced by the distribution of membership functions in the initial 
clustering stage. The neutrosophic components <t, i, f> define the 
domain in which an element belongs (t=true=membership) or 
not (f=false=non-membership) to a certain fuzzy set and if the 
compatibility criterion in granulation produces a high indetermi- 
nate value (uncertainty). In Fuzzy Logic theory, entropy has been 
employed as a measure of Fuzzy information about a Fuzzy set or 
system in the universe of discourse. Shannon considered the loga- 
rithmic behaviour of the entropy due to its addition property [22], 
and as an information gain from an event that is inversely related 
to its occurrence [23]. 

For example in [24] an ensemble NN was presented to predict 
mineral prospectivity into deposit or barren cell. In that work, two 
Neural Networks were employed to estimate the degree of truth 
and false membership values, and finally calculate the degree of 
indeterminacy by a using an interpolation method. In this paper, 
the Neutrosophic scheme is utilised to influence the compatibility 
measure between granules by evaluating the uncertainty produced 
by the merging of the information granules. This paper is organised 
as follows: in the next section, Section 2, a systematic modelling 
approach is presented based on Granular Computing and an RBF 
Neural-Fuzzy structure. Section 3 presents a Neutrosophic scheme 
based on a version of Shannon’s entropy definition for the formula- 
tion of uncertainty to account for any uncertainty produced during 
the merging of the information granules. In Section 4, a case study 
is presented based on a real industry dataset related to the mea- 
surement of Charpy Toughness in heat-treated steel. This process is 
known in the steel industry for the uncertainty induced in the mea- 
surements due to complex thermomechanical phenomena during 
the mechanical testing of the material. Finally, Section 5 concludes 
this paper and suggests possible future research directions. 


2. Granular computing and RBF neural-fuzzy networks 


Granular computing is a computational paradigm that mimics 
the cognitive human abstraction in order to group together entities 


with similar features, i.e. volume, density, geometrical proper- 
ties, cardinality, function, overlap, etc. To achieve the information 
grouping, granulation employs a criterion measure that calculates a 
‘compatibility index’ based on granular similarity. In essence, gran- 
ulation is an iterative process, which consists of two main steps. In 
this paper, we will extend a method based on the iterative approach 
shown in [4,8]: 


e Find the two most compatible information granules and merge 
them together as a new information granule containing both orig- 
inal granules. 

© Repeat the process of finding the two most compatible granules 
until a satisfactory data abstraction level is achieved. 


The compatibility index C(A,B), defined as the merging two dif- 
ferent granules A and B is based on various geometrical properties 
and the number of elements that form each granule. The compat- 
ibility defines the most important concept during the granulation 
process. 


C(A, B) = Distanceyax — Distance, pg - exp(—a@ x R) (1) 
in which 


Cap /Cardinalityyjax 
La.p/Lengthyax 


Distanceyax =Maximum possible distance in the data set, 
Distance, g =the multidimensional average distance. 
Between two granules A, B such as: 


d 
Dain i(Os — D2) (2) 


Distance, p = 
In which 


D, = max (max,j, Maxg;) 


D2 = min(min,;, Ming;) 


w;,: the importance weight for dimension i; d: the number of dimen- 
sions; max ,;: Max limit of granule ‘X’ in dimension ‘i’; miny;: min 
limit of granule ‘x’ in dimension ‘i’; a: weights the requirements 
between distance and cardinality/length [8]; Cardinality)jax: the 
maximum possible cardinality in the data set; Lengthyax: the 
maximum possible length of a granule in the data set; C4 g: the car- 
dinality of the resulting granules; L, g: the multidimensional length 
of the resulting granule, where maxy; and miny; are the maximum 
and minimum length respectively of the resulting granule at each 
dimension i. 


d 


Lae = S_ (max; — miny) (3) 
i=1 


To exemplify the compatibility calculation C(A,B) [4], in Fig. 1 is 
given a 2-dimensional granular space where the granule A and B are 
merged. The term a is used as a threshold value in the interval [0-1] 
to balance the two compatibility measures of ‘distance’ and ‘den- 
sity’ (cardinality/size), and w; weights each dimension according 
the problem at hand [4]. In this work, w; is used with the value of 1 
in every dimension. In Fig. 1, granules A and B produce the following 
values: 


d d=2 
Distanceyax = S “distances; = bme —(-1))=4; 
i=1 i=1 
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Data Set 


Fig. 1. Resulting Granule ‘C’. 


{max(0.9, 0.55) — min(0.4, —0.1)} 
2 


_ {max(0.2, —0.1) — min(—0.8, —0.2)} 1 
ij 2 = 


Distance, 3 = 


Ca.p = Cardinality, + Cardinalityg = 15 Granules; 


Cardinalityyax = Granule, + Granuleg + Granulep + Granules 
+Granuler + Granuley = 8+74+104+2+6 


+8 4+ 2 = 43 Granules 


Lap = 2; 
Lengthyax = 3.93; 
a= 0.35; 


C(A, B) = 4—1 x exp(—0.35 x 0.685) = 3.123 


Since the compatibility criterion in Equation (1) is based on the 
multidimensional length of each granule and its cardinality, the 
granular index decreases while the numbers of iterations increases, 
as less compatible granules are merged. Fig. 2, illustrates a typi- 
cal evolution of the compatibility measure. As expected, the index 
reduces dramatically (fall-off) which represents less compatible 
(dissimilar information) is merged towards the end of the gran- 
ulation process. This may be also used as a criterion to terminate 
the iterative process. 

For simplicity purposes just two dimensions are used to demon- 
strate the granulation concept. Fig. 3 illustrates the granular 
compression with a two-dimensional data set at various levels. The 
top left graph in Fig. 3 shows the initial raw data, and then the subse- 
quent graphs show how such data are merged to form the required 
number of final information granules. Moreover, Fig. 3, depicts the 
granular process divided into three main steps: (a) raw data; in this 
stage each datum is considered as a granule into the input space and 
hence compressed in compact and dense granules, (b) input space- 
data granulation; during this iterative process the initial number of 
granules is reduced according to their compatibility in which vari- 
ous similarity measures can be taken into account, such as: the size 
of granules, the cardinality, overlapping among granules, orienta- 
tion, etc., and finally (c) output space-density function represents 
the linguistic interpretation of the final group of dense granules 
which preserve the original features of the raw data. 

In the first stage of granulation the raw data are compressed in 
dense and compact granules. Without loss of generality, as it was 
explained in [8] the final geometrical boundaries of each informa- 
tion granule are used to estimate the initial value of C; and oj which 
are illustrated in Fig. 4. The average hyper-box boundaries of each 
granule are utilised to calculate the initial CG as follows: 


C= | Cie Geta|? Folate? (4) 


Where j is the jth hidden neuron and i the ith input. 


maxy; — miny; 
Gi = Xi 5 ‘Xi (5) 


32.2 


32 


w 
BN 


Compatibility max 2t iteration ’j’ 


Optimal number of clusters 


400 


800 1200 


iteration j 


Fig. 2. Compatibility measure example. 
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Fig. 3. Data granulation process. 
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Fig. 4. General structure of an RBF-NN. 


Here, the width of the Gaussian function in the RBF layer is 
estimated by [25] and hence its calculation is done via Equation 
(6): 
: ; 2 1/2 
a= =| 22 lle 6) 
k=1 


Where C;, is the nearest neighbours of centroid C;. 
2.1. Radial basis function neural networks 


Considering the equivalence between the Radial Basis Function 
Neural Networks (RBFNN) and Takagi-Sugeno type-O fuzzy sys- 
tems (or type-1 Mandani inference engine) as expressed in [13,14], 
an RBFNN combines the input-output n+1 dimensional space {x;, 
X2,.-+)Xj, +--+) Xp, Yp+1} where x; represents the input partition and 


yp+1 the corresponding output as is illustrated in Fig. 4. Due to this 
functional equivalence that lies on the fact [13]: 


(a) The number of the receptive fields (hidden layer) is equal to the 
number of if-then rules. 

(b) The output of each fuzzy if-then rule is composed of a constant. 

(c) The n+1 dimensional membership is chosen as Gaussian func- 
tion with the same variance. 

(d) The firing strength is based on the t-norm operator. 

(e) The RBFNN output is estimated on weighted average or 
weighted sum. 


The final number of granules (Fig. 3) obtained after granulation 
process are used as the A; initial fuzzy sets to construct the if-then 
rules as follows: 

Rj: If x1 is Ay and x3 is Ag, x is Aj, ..., and Xp is Ap 

Then y is z;; m=1, ..., P=number of hidden neurons 

Typically, z; can take any of the following three forms: 


(1) Singleton 
(2) Fuzzy set 
(3) Input-based polynomial function 


By assuming a one-to-one relationship between a multidimen- 
sional granule and a Fuzzy Logic linguistic rule of the form shown 
above, one can construct a whole rule-base using the informa- 
tion granules. The extra granular features (e.g. cardinality, centre 
of gravity) can be used to estimate an initial centre and width for 
the resulting Fuzzy sets of the rule-base as it will be shown over 
Section 3.1. 
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In this investigation, a system with a centre of gravity defuzzifi- 
cation will be used, product-inference rule, and a singleton output 
layer, which are expressed by: 


= P 
m > T= Mii 


j=l 


where /;; is the membership value of the input i which is in jth rule. 


7 (xj = cy) 
(8) 
Thus from (7) it can be written that: 


P 
y=) 785%) (9) 
j=l 


where g; is defined as: 


g(x) = 


me 
S mix) 
j=l 


where x is the input vector and m its corresponding membership 
evaluation which is defined as: 


= oF 2 
(x) = ell*-S/97) (11) 


Once the initial parameters cj and oj are estimated, their opti- 
misation can be carried out by using various algorithms including 
such as algorithms based on gradient descent (GD), or Evolutionary 
Algorithms (EA). In this research work an adaptive Back-Error- 
Propagation (BEP) algorithm is employed [13-15], which has been 
proven in the past to be very efficient in the optimisation of the 
proposed type of system. 

The update rule for the centre estimation: 


Xi - C(t, i) 


G(t +1, i) = yG(t, i) « eg;(y; — y) z (12) 
Jj 
The update rule for the width estimation: 
2 
So - Git) 
di(t +1) = ydj()— « egy —y) | —— (13) 
Jj 
The update rule for the output weight estimation: 
w(t + 1) = yw; (t) — agje (14) 


P 
where a: Learning rate; y: Momentum.e = y — y¢ = S ogi ‘VWi-Ye 
j=l 

e is the training error of the ith data point; Y; is jth output from 
the data; y is output from the model; t is tth training data point; i 
is i input to the neural network. 

Although, the BEP leads the objective function to a good local 
minimum by using a small learning rate, often it does not represent 
the optimal performance of the system due to the algorithm ‘getting 
stuck’ in local minima. In order to overcome this issue amomentum 
based term and a continuously adaptive version of BEP is used, as 
previously presented in [8]. 


3. Neutrosophic logic in granular computing 


The concept of Neutrosophic sets was introduced by F. Smaran- 
dache as a generalisation of fuzzy and intuitionistic logic [26] in 
order to deal with the truth (T), falsehood (F) and the indeter- 
minacy (I) of an event to happen. In fact, Smarandache used the 
concept of infinitesimals in order to define the non-standard [27] 
real subsets]~a, b*[. According to [26,27], a number x is said to be 
an infinitesimal if and only if for all positive numbers n the number 
x is defined as |x| <1/n. Moreover, the elements of a non-standard 
interval are ~a=a—x and b* =b+x. In Neutrosophic logic, its com- 
ponents are introduced by using a standard or non-standard unit 
interval as follows: 

Let T, I, F be standard or non-standard real subsets of]~—0, 1*[, 
with sup T=tsup, inf T= ting 

sup I=isup, inf I= ting 

sup F=fsup, inf F= fing 

where inf]~ a, b*[=~a and sup]~a, b*[ = b* 

The elements T, I, F can be interpreted as intervals, standard 
or non-standard real sets, discrete, continuous, single-finite sets, 
operations under intersection or union, fuzzy numbers, normal dis- 
tribution, etc. For this reason the tuple <t, i, f> [26] represents the 
truth value, indeterminacy value and falsehood value. In information 
theory [28] three basic types of uncertainty are defined, namely: 
(a) fuzziness: the lack of definite or sharp distinction, (b) ambi- 
guity which is divided into two types (b-1) strife: defined as the 
disagreement in choosing several alternatives and (b-2) nonspeci- 
ficity when two or more alternatives are left unspecified. Here, it is 
considered the lack of sharpness in the sense if two granules really 
belong to each other based on the local entropy produced by their 
merging. 

The granulation process, as described in [8,29], aims to com- 
press the initial data into compact and dense granules based on the 
resulting cardinality and the multidimensional length of any two 
merged granules. Moreover, this methodology exploits as much 
as possible the density (‘richness’ of information) of the granules. 
However, sometimes the introduction of a new granule in the input 
space produces a lack of distinguishability due to the overlapping 
which is not considered into the compatibility Equation (1). In order 
to address this issue we propose the introduction of granulation 
under a Neutrosophic scheme in order to estimate the uncertainty 
in the pattern space. The hypothesis is that if the granulation com- 
patibility index ‘favours’ the merging of granules that will lead to 
less accumulated uncertainty in the data set then the resulting 
multidimensional granules, hence Fuzzy Rules, will have less uncer- 
tainty leading to more robust Fuzzy Inference models. On one hand, 
in Neutrosophic terms granulation uncertainty or granulation inde- 
terminacy represents the ‘hesitation’ of two granules to belong to 
each other either as a single point or as an interval in the partition 
space. The interpretation of a granule in N dimensional space can be 
interpreted as a composition of N standard intervals]-0, L*[where 
L>0. On the other hand, in terms of fuzzy theory, fuzziness has 
been defined as a result from the lack of sharpness of relevant dis- 
tinctions, and it is different from ambiguity. In [30] the definition of 
uncertainty is based on a distance function with the same general 
view of fuzziness. 


f(A)= -Sva (x) log2A (x) + A(x)log2A(x) (15) 


xeX 


In [23,31,32] various definitions of uncertainty have been pro- 
posed; here we will employ that defined in [23]. 


ey = C+ pje"-Pi) + (1 — pe, Ce [0, 1] (16) 
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R? 


GranuleB 


a) Granular overlapping 


Up=1.0 


b) Granular Membership per dimension 


Fig. 5. Interval and numerical membership representation. 


Where p is the probability of the event j and Sop =1,0< 
j 

pj <1 

In order to justify the exponential function, we denote its gain in 
information corresponding to the occurrence of the ith event [23] 
and its properties as any entropy expression. 

P1: ey is defined at all points in [0,1] 

P2: Tent ee = Aey (pj = 0) = k1, k; => O and finite 


Pi> 

P3: ey (pj = 1) = kz, kp > O and finite 

PA: ko <ky 

P5: with increase in ey, Aey decreases exponentially. 

In other words with increase in the uncertainty the gain in infor- 
mation increases exponentially. In a similar approach to the one 
presented in Equation (16), in Fuzzy Set theory p; can represent the 
membership of an element jz; ¢ Uy where Uy is the set of fuzzy sets. 
The aims of granulation in the proposed framework are to (a) define 
the linguistic scenario [8] under the compatibility criterion and (b) 
define its potential application in high dimensional variable space. 
From a GrC perspective, the interpretation of the membership in 
R2 can be considered as shown in Fig. 5. 


n 


Here p; = Hj) _ Me n=all the granules k that overlap to granule 


k 
j, and ju; is defined as 


ANB ; 
Lp = B= | sup (B) — inf (B)| (17) 
Referring back to the above equation, the tuple <t, i, f> can be 
individually interpreted as 


Therefore, equation (16) can be rewritten as 
i; = C + Geli + fet, Ce]-0, 1+] (19) 


Fig. 6 shows plot of the evaluation of granular indeterminacy in 
terms of t and f. It takes its maximum value imax when the overlap- 
ping in granulation is equal to 0.5 and then t=f. In other words, i; 
depicts the indeterminacy produced by merging two granules A and 
B and how this (union) affects the granular neighbourhood. More- 
over, the indeterminacy between two granules A and B is expressed 
by: 

1 d 
iqug = pi d = number of dimensions (20) 
j=l 


0.6 |- 


0.45 


Indeterminacy (units) 


1 1 1 


0 L 1 L L 
0 0.1 0.2 0.3 0.4 


1 1 
5 0.5 0.6 0.7 0.8 0.9 1 
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Fig. 6. Indeterminacy value. 
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AUB If x: is A and xz is B then y is z1 
If x: is A and x; is C th i 
a) => x: is A and x; is en y is z1 
i i If x: is A and x2 is B then y is z1 
b) BUC If x: is A and x: is C then y is z2 


Fig. 7. A granulation step using a distance similarity measure and indeterminacy respectively. 


Finally, the granular compatibility criterion, Equation (1), can be 


uncertainty/indeterminacy could be evaluated by using the tuple 
written as 


<t, i, f> as a histogram of such components as follows: 


C(A, B) = Distancemax — [iqug + Distance, g x exp(—a@ x R)] (21) 


1 
i(t) — FO) y j 
Ni(t) = dx Pal” x i(t) (22) 
The compatibility criterion, as described by Equation (21), will 
allow the merging of the most compatible granules while at the 
same time the granular uncertainty is taken into consideration. Where: 
Equation (21) is a minimisation cost function; hence the gran- d: number of dimensions. 
ulation will follow the ‘path’ of the minimum uncertainty. The card: the resulting cardinality of the new merged granule. 
‘disorder’ produced during the granulation process in terms of t: iteration t 
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b) Granular Computing using the uncertainty estimation 


Fig. 8. Sample granulation scenario. 


4017 


A.R. Solis, G. Panoutsos / Applied Soft Computing 13 (2013) 4010-4021 


w 
— 


l 


2 lo 
sojdwes jo ‘ou 


Charpy Toughness, joules 


a 
sojdwes jo ‘ou 


Charpy Toughness, joules 


We 


ie) o ito} ot 
Es 


= 


sejdwes jo ‘ou 


aa 


i 


Test Depth, mm 


Charpy Toughness 


o vt N Oo 
sojdwes jo ‘ou 


Specimen Size, mm 


joules 


Charpy toughness 


Fig. 9. Sample of Charpy Toughness data space [6]. 


there are some situations, in which such a density and distance 
measures do not produce the best orientation and distribution 


3.1. Uncertainty in the linguistic scenario and granular 


information ‘coverage’ 


of the new merged granules. More specifically, this can repre- 


sent a loss of transparency in the final linguistic rules, and their 
characterisation. For instance, in Fig. 7a the two final granules 
produce a misinterpretation of the consequence in the linguistic 


Taken in its broad sense, granulation iterative methodology 


described in [8] considers the proximity between any two entities 
and its cardinality and length as a compatibility measure. However, 
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Fig. 10. Initial Membership functions by using Granular Compression. 


scenario, and hence this composition bears a lack of a parsimo- 
nious modelling. The AUB resulting granule covers an area (lower 
left of the granule) where raw data - information — simply does not 
exist despite following the compatibility objective function. One of 
the motivations to include the uncertainty under this merging pro- 
cess is to eliminate as muchas possible this undesirable granulation 
behaviour, and promote a better granular coverage under a Neu- 
trosophic scheme, where the granules are strongly linked with the 
raw data/information. 

As a further example, Fig. 8 illustrates the final granules con- 
structed after applying just the granulation process indicated in 
Equation (1), and those obtained with the expression shown in 
Equation (21). The resulting final granules in the second scenario 
include more granular information on overlapping; however the 
individual granules represent more accurately the underlying sub- 
granules/data. 


4. Case study and simulation results 


In this section, a case study is provided to evaluate the effective- 
ness of the proposed method for improving the interpretability of 
the input space partitioning while preserving the global accuracy. 
The example consists of a data set related to the impact energy tests 
(Charpy Toughness) of heat-treated C-Mn grade steel. 


4.1. Impact energy 


One of the most popular and standardized impact techniques 
is the Charpy test, which is a toughness measurement, and it is 
usually employed in the industry [33]. This technique is used to 


Table 1 
Charpy toughness: input variables. 


Chemical composition Test parameters Heat treatment 


C, Si, Mn, S, Cr, Mo, Ni, Al, V Test depth, Hardening 
Specimen size Test temperature 
site 


Test temperature Cooling medium 
Tempering 


temperature 


ascertain the fracture characteristics of materials; it is mainly used 
to estimate the impact energy (Joules) of a standard size/shape bar 
of square cross section during its fracture by another standard type 
of cantilever equipment. 

The load is applied as an impact blow froma weighted pendulum 
hammer, which is released from a specific height. The specimen is 
placed ona base and suddenly hit by the pendulum that fractures it. 
Due to its low repeatability, high cost, and imprecise and scattered 
results obtained under the same input conditions this test turns out 
within a certain output space region but with high uncertainty and 
variability [33] inherited in the measurements. 

The Charpy toughness data set used in this work consists of 1661 
measurements on heat-treated steel [31] (TATA Steel, Yorkshire). 
The data set has 16 input dimensions, and 1 output (Impact Energy, 
Joules), the scarcity of some of the data dimensions is illustrated in 
Table 1. 

For cross-validation purposes the data have been split into train- 
ing, checking and testing, in order to avoid over-fitting the model 
enhancing its generalisation properties. The initial data used to 
train the GrC-NF model consists of 1084 (65%), which are composed 
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Table 2 
RMSE performance of 3 similar NF structures using Fuzzy C-means and granular 
computing. 


Training Checking Testing 
NF with Fuzzy C-means 18.78 20.18 21.78 
GrC-NF [8] 14.66 21.24 20.42 
Neutrosophic GrC-NF 16.10 18.3 19.34 


of just raw data. The checking data and testing data are 277 (17%) 
and 300 (18%) respectively. 

Unlike the methodology described in [8] the term igug is 
introduced to estimate the indeterminacy produced by the over- 
lapping created in each dimension considering just intervals or 
simply the corresponding face of a granule. Once, the final com- 
pression is obtained this information is captured by the GrC-NF in 
order to train the RBF Neural-Fuzzy structure. In Table 2, it is shown 
a comparison of two previously obtained results via other granula- 
tion methods and those obtained with the proposed indeterminacy 
estimation. Even though, in [8] the training performance is better, 
the testing and checking simulation turned out more robust bear- 
ing an enhanced generalisation (testing) which is very significant 
for this type of industrial data. 

As it was also suggested in [34], here a simulation with less than 
6 or more than 18 granules is not considered due to the poten- 
tial over-fitting behaviour or under-representation of the raw data 
occurs during the training stage. In this sense, an RMSE index to 
measure the performance during the training, checking and testing 
stage by using an initial partition space of 9 granules is suggested. 
Table 2 indicates a comparison between the granular comput- 
ing with uncertainty and those mentioned in [8] and the results 
obtained by means Fuzzy C-means. 

The initial membership functions obtained by just considering 
granular compression are shown in Fig. 9. There, the overlapping 
caused by the merging stage is significant and this produces a lack 
of sharpness in the distinction of rules. In this sense, we try to atten- 
uate this indeterminacy by persuading the combination of granules 
under a low overlapping scheme. Moreover, we believe that a bet- 
ter understanding in granular uncertainty must be studied taken 
into account its compatibility features (Fig. 10). 

The membership functions obtained by granular computing 
and uncertainty in Fig. 11 offer a better distinguishability as a 
result of identifying measuring the uncertainty produce during the 
granulation and how this value is used. However, this does not 
represent the best partition; since it is clear there are some input 
regions with a lack of good interpretability. Fig. 12, illustrates the 
measured-predicted fitness plots as obtained via the GrC-NF Neu- 
trosophic approach. Table 2 demonstrates the performance of (a) 
a NF model created via a clustering framework based on the well- 
known Fuzzy C-Means algorithm [35,36] (b) AGrC-NF model which 
is based on the results obtained in [8], and (c) GrC-NF with the 
neutrosophic tuple <t, i, f>. From the results shown in Table 2 one 
can notice the improved checking performance and most impor- 
tantly the improved testing performance (generalisation). Finally, 
Figs. 13 and 14 show the Neutrosophic index evaluation and the 
RMSE progression during the optimisation procedure respectively. 
The ‘path’ followed by the Neutrosophic GrC-NF model is slower 
convergent however it reaches to a better performing final model, 
possibly by the inclusion of higher quality information granules 
(rules) which have less inherent uncertainty. The neutrosophic 
index evaluation plot shows that during the end-stage of the opti- 
misation routine the resulting indeterminacy of the final granules 
(rules) is less on the case of the GrC-NF Neutrosophic model. In 
short, Fig. 13 represents the distribution order of the final granules 
in the partition space. 
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Fig. 12. Charpy Toughness model fit. 
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Fig. 14. Root mean square error performance using <t, i, f>. 


This index reflects the behaviour of the compatibility expression 
in terms of <t, i, f> and the final distribution of the resulting granules. 


5. Conclusions 


In this paper, a systematic modelling framework based on 
Granular Computing, Neural-Fuzzy modelling and Neutrosophic 
Logic is proposed. The presented approach mimics the human 
cognition in terms of grouping similar information (granules) 
together based on a number of similarity measures (in the com- 
putational case: proximity, cardinality, length). Furthermore, the 
proposed approach uses a Neutrosophic Logic concept to esti- 
mate inherent information uncertainty/indeterminacy due to the 
merging operation during the information granulation process. 
The uncertainty/indeterminacy index, calculated via a Shannon 
entropy criterion, is iteratively calculated during the granulation 
process and this results in a final GrC-NF inference system with 
a more robust rule-base with better representation of the raw 
data/information. This approach is applied to a real industrial 
dataset, based on the measurement of the Charpy Toughness of 
heat-treated steel, a process that is particularly known for the pro- 
duction of sparse and uncertain data. The proposed methodology 
is successfully applied to the industrial dataset and the results 
show an improved generalisation and model interpretability per- 
formance as compared with similar previous modelling attempts 


based on the exact same dataset. However, further investiga- 
tions need to be performed with different datasets and synthetic 
benchmarks to establish the overall performance of the proposed 
approach under different information granulation conditions. It 
would also be interesting to use this new framework presented 
in this paper and try to introduce different levels and measures of 
information uncertainty, for instance that uncertainty caused by 
conflict, congruency, discord, and ambiguity in terms of nonspeci- 
ficity, particularly variety and diversity in the creation of granules. 
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